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Abstract. We propose a new global SPACING constraint that is useful in model- 
ing events that are distributed over time, like learning units scheduled over a study 
program or repeated patterns in music compositions. First, we investigate theoret- 
ical properties of the constraint and identify tractable special cases. We propose 
efficient DC filtering algorithms for these cases. Then, we experimentally evalu- 
ate performance of the proposed algorithms on a music composition problem and 
demonstrate that our filtering algorithms outperform the state-of-the-art approach 
for solving this problem. 



1 Introduction 



When studying a new topic, it is often better to spread the learning over a long pe- 
riod of time and to revise topics repeatedly. This "spacing effect" was first identified 
by Hermann Ebbinghaus in 1885 [1 1. It has subsequently become "one of the most 
studied phenomena in the 100-year history of learning research" [2|. It has been ob- 
served across domains (e.g. learning mathematical concepts or a foreign language, as 
well as learning a motor skill), across species (e.g. in pigeons, rats and humans), across 
age groups and individuals, and across timescales (e.g. from seconds to months). To 
enable learning software to exploit this effect, Novikoff, Kleinberg and Strogratz have 
proposed a simple mathematical model [3 1. They consider learning a sequence of edu- 
cational units, and model the spacing effect with a constraint defined by two sequences, 
A = [di , et2) • • •] and B = [bi , b%, . . .] . For each unit being taught, the i + 1st time that 
it should be reviewed is between and bi time steps after the zth time. 

This technical report, after giving a brief background in the following section, de- 
scribes and analyses the Global SPACING Constraint in the section 3. Then the sections 
4, 5, 6, 7 and 8 explore restrictions of the constraint and identify tractable special cases. 
In the sections 5 and 6 we propose efficient filtering algorithms for these tractable cases. 
In the section 9, we describe a useful application of the constraint to solving a music 
composition problem. The section 10 presents experimental evaluation of the SPACING 
constraint on the music composition problems. 



* NICTA is funded by the Australian Government as represented by the Department of Broad- 
band, Communications and the Digital Economy and the Australian Research Council. 



2 Background 



2.1 Constraint Satisfaction Problems (CSP) 

A constraint satisfaction problem consists of a set of variables, each with a finite do- 
main of values, and a set of constraints specifying allowed combinations of values for 
subsets of variables. We use capital letters for variables (e.g. X and Y), and lower 
case for values (e.g. d and d'). We write D(X) for the domain of a variable X and 
D = U"=i ^C^*) f° r me set °f a ^ th e domain values. Assigning a value d G D(X) 
to a variable X means removing all the other values from its domain. A solution is an 
assignment of values to the variables satisfying the constraints. Constraint solvers typ- 
ically explore partial assignments enforcing a local consistency property using either 
specialized or general purpose propagation (or filtering) algorithms. A support for a 
constraint C is an assignment that assigns to each variable some value from its domain 
and satisfies C. A constraint is domain consistent {DC) iff for each variable Xi, every 
value in -D(JQ) belongs to some support. 

2.2 Matching Theory 

We also give some background on matching. A bipartite graph is a graph G = (U,V,E) 
with the set of nodes partitioned between U and V such that there is no edge between 
two nodes in the same partition. A matching in a graph G is a subset of E where no 
two edges have a node in common. A maximum matching is a matching of maximum 
cardinality. Regin[4| proposed efficient propagator based on a maximum matching al- 
gorithm. 

2.3 Propositional Satisfiability (SAT) 

A Propositional Satisfiability (SAT) is a problem of finding a model of a propositional 
formula. The propositional formula is usually in Conjunctive Normal Form (CNF), 
which is a set of clauses. We consider the clauses to be sets of literals. A literal is 
either a propositional variable (i.e. p) or a negated propositional variable (i.e. -<p). The 
set of all variables from <f> is var(<f>) and -^var{4>) — {^p \ p G var(<f))}. Without loss 
of generality we can order the variables from var(4>) into sequence [pi, . . . , p v ] and the 
clauses of <p into sequence [C\, . . . , C c ]. Let lit(cf>) = var{4>) U -ivar(<f>). Then a set of 
literals / C lit(<p) is an interpretation of 4> if it is maximal and consistent, i.e. it does 
not contain a pair of complementary literals I G / — > I £ I. An interpretation / of <f> 
is a model of <j> if it contains at least one literal from each clause of (j>. We also use the 
following notation: if L is a set of literals, then L' — {I' | I G L} and L % = {V 1 \ I G L}. 

3 The Global Spacing Constraint 

First we describe the Global SPACING Constraint and then its further restrictions in the 
following sections. For simplicity, we introduce a function that returns the number of 
occurrences of a domain value d G D in a sequence of variables X — [Xi , . . . , X n ] . 
It is occ(d,X) = |{i | Xi = d, 1 < i < n}\ Now we define the Global SPACING 
Constraint as follows: 



Definition 1. Let X = [X\, . . . , X n ] be a sequence of n variables and let S C D be 
a set of domain values. Let A — [a\, . . . , ctfe-i] and B = [bi, ... , bk—i] be sequences 
of natural numbers such that di < bi for 1 < i < k — 1. Then SPACING(S, A, B, X) 
holds iff for all i s.t. 1 < i < k — 1 and for all d € S it holds that if there exists j 
s -t- 1 < j < n and Xj = d and occ(d, [Xi, . . . , Xf\) = i then there exists j' < n s.t. 
j + o-i < j' < j + bi and Xj' — d and occ(d, [Xi, . . . , Xj>]) = i + 1. 

In other words, each value d £ S either does not occur in X at all, or it occurs 
on at least k different places and the distances between the places are determined by 
the sequences A and B. The minimum distance condition forces the i + 1st occurrence 
of the value d to be no closer than places from its ith occurrence and the maximum 
distance condition forces the i + 1st occurrence to be no further than bi places from the 
ith occurrence. 

For example, suppose we need to prepare a playlist for a radio station. The SPACING 
Constraint allows us to specify that any of top ten songs is either not played at all, or 
it is played at least four times in the 360 song long playlist and it is not repeated more 
frequently than every 30 songs, but at least every 90 songs. The constraint would be im- 
posed on the sequence X — [Xi, . . . , X 360 ] of n = 360 variables. The domain values 
would represent the songs and SCfl would be the set of the top ten songs. The number 
of spaced occurrences is k = 4, so the sequences A and B would be of length k — 1 = 3. 
Overall the constraint would be specified as Spacing(5, [30, 30, 30] , [90, 90, 90] , X). 

Theorem 1. Enforcing DC on the Global SPACING Constraint is NP-hard. 

Proof. We prove this by reduction of SAT to the problem of finding a support for 
Spacing. Let <p be an arbitrary CNF with v propositional variables and c clauses. We 
will abuse the notation slightly by using literals as domain values. There is a model of (j) 
iff there is a support for the constraint SPACING^, [oti, . . . , a^-i] , ■ • • , bk-i] , X) 
with 

- S = lit(4>) 

- k = c+ 1 

- dj = 1 and bi=v + lfoxl<i<k — 1 

- X is a sequence of vc + v + c variables with domains as described below. 

If we cut the sequence X into slices v + 1 variables long and put the slices under each 
other, we obtain a table with c + 1 rows where the last cell is empty. For simplicity 
the variables will be indexed Xj t i, where j is a row number and i is a column number, 
i.e. Xj t i stands for X^_ 1 ^ v+1 - )+i . The first v columns will represent the propositional 
variables of <p, so 

- the domains of variables Xjj for 1 < i < v and 1 < j < c + 1 are {pi, ^Pi}. 

The last column represents (f>, each clause in one row, so the domains are sets of literals 
from particular clauses. Simply, 

- the domains of variables Xj tV+ i for 1 < j < c are Cj. 



For example take CNF (j> — [{-*p, q, r} , {^q, r} , {->p, ^q} , {p, q}} the equivalent 
constraint would be Spacing(5, [1, 1, 1, 1] , [4, 4, 4, 4] , X) with 5 = {p, q, r, -<p, -<q, -ir} 
and X would contain 19 variables with domains ordered into the following table: 



Note that D C S. So if a value occurs in a support for the constraint, it has to occur 
in it at least c + 1 times. Due to construction of the domains, one value can occur on at 
most 2c + 1 places (in one of the first v columns and in the last column). Each value is 
sharing c + 1 of these places with its complement, thus if a value occurs in a support, 
it occupies at least c + 1 of 2c + 1 places, so its complement does not have space to 
repeat enough times to satisfy the constraint. Hence if a value occurs in a support it will 
occupy one of the first v columns completely, because its complement cannot. 

Consequently, if we have a support for the constraint, the set of values assigned to 
the first v variables is a model of <f), because each value selected in the last column has 
to occur in it and complement of none of these values can occur in it. 

On the other hand, if we have a model of <f>, we can obtain a support for the constraint 
by assigning the literals from the model to all the variables as a values. This is always 
possible, because a model interprets all the literals and it is consistent, so there will be 
exactly one value for each of the variables in the first v columns, and a model satisfies 
each clause, so there will be some value for each variable in the last column. This will 
really be a support for the constraint, because each value that will occur in it will occur 
in each row at least once. □ 

Please, note that the proof does not make use of the condition of minimal spacing 
imposed by the sequence A. Also the sequence B is constant (bi = b i+1 for all 1 < i < 
k — 2), so the proof holds also for much simpler constraint. In fact, the proof does not 
use the condition that the values have to be repeated, it is based barely on the number of 
occurrences. The reason why SPACING is NP-hard is that there is a possibility for the 
values to either occur in the support or not. Next we will examine a case that does not 
provide such a possibility. 

4 The Forced Global Spacing Constraint (Spacing^) 

This section describes a restriction of the constraint where all the values from S are 
forced to occur in the sequence and later it analyses also restriction with the distance 
conditions relaxed. 

Definition 2. Let X = [X\, . . . , X n ] be a sequence of n variables and let S C D be 
a set of domain values. Let A — \a\, . . . , ak-i] and B = [bi, . . . , &fe_i] be sequences 



of natural numbers such that ai < bifor 1 < i < k — 1. Then SPACING/? (5, A, B, X) 
holds iff Spacing (S, A, B, X) holds and all S C {X x ,...,X n }. 
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{P, ^P} {q, -"?} {r, ^r} {^p, q, r} 
{P, -^P) {q, ""?} {r, -■r} {->q, r} 
{p, ~^p} {q, ->q} {r, -t} {^p, ->q} 
{P, ~^P} {q, -"?} {r, -t} {p, q} 
{P, ^P} {q, ""7} {r, -t> 



This means that each value v S S occurs in the sequence on at least k different places. 

Theorem 2. Enforcing DC on the Global S PACING Constraint is NP-hard. 

Proof (no minimal distance condition). We prove this by reduction of SAT to the prob- 
lem of finding a support for SPACING^?. The reduction is the same as in the proof of 
TheoremQ] except the sequence X has additional (c + 2)(v + 1) + 1 variables (in total 
(2c + 3)(u + 1) variables) with domains as follows: If we order the variables into the 
same table as before, with rows of length v + 1, the variables in the rest of the last 
column, as well as the variables in the c + 2nd row can only take a dummy value 

- the domains of variables X^+i for c + 1 < j < 2c + 3 and variables X c+ 2,i for 
1 < i < v are {0}, where 0^5 

and the first v cells of remaining c + 1 rows are again the representation of propositional 
variables, so 

- the domains of variables Xjj for 1 < i < v and c + 3 < j < 2c + 3 are again 

For example, the CNF (f> = [{^p, q 1 r} , {-ig, r} , -iq} , {p, q}] from the previ- 
ous example would be reduced to the constraint Spacing_f(S', [1, 1, 1, 1] , [4,4,4,4] , X) 
with S = {p, q, r, -^p, -^q, -ir} and X would contain 44 variables with domains ordered 
into the following table: 
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We can divide the sequence into 2 parts separated by the c + 2nd row. We will call 
the first one positive part and the second one negative part. If a value from S occurs in 
one part, it will occur at least c + 1 times in that part, because the gap of 0-s between 
the parts is longer than any so all the necessary repetitions must occur in one part. 
Also if a value d 6 S occurs in one part, its complement d cannot occur in the same 
part because of a counting argument analogous to the one in the proof of Theorem Q] 
so, while all the values from S must occur in the support, the complement d must occur 
in the other part (where d will not occur due to the same argument). 

Thanks to this, the proof of Theorem[T]holds for the positive part and the values that 
cannot occur in the positive part will occur in the negative part. It is obvious that there 
is enough space for all the values: \S\ — 2v while v values occur in the positive part 



and the other v values occur in the negative part. Hence, we can obtain a support for the 
constraint from a model of <f> and a model from a support in the same manner as in the 
proof of Theorem Q] □ 

As before, this proof does not make use of the condition on the minimal distance 
between two consecutive values from S {A is a sequence of 1-s). In fact, also if we relax 
the condition on the maximal distance between two consecutive values from S (B will 
be a sequence of n-s), the constraint is still intractable. 

Theorem 3. Enforcing DC on the Global SPACING^ Constraint with bi = n for 1 < 
i < k — 1 is NP-hard. 

Proof (no maximal distance condition). We prove this by reduction of SAT to the prob- 
lem of finding a support for SPACING. Let <\> be an arbitrary CNF with v propositional 
variables and c clauses. There is a model of <f> iff there is a support for the constraint 
Spacing f (S, [oi, . . .,a k -i] , [h, . . . A-i] , X) with 

- S = Ut{(j)) 

- k = c 

- di = 5v + 1 and bi = (7v + l)c for 1 < i < k — 1 

- X is a sequence of (7v + l)c variables with the domains as described below. 

If we organize the variables from X into a table with c rows and 7v + 1 columns, the 
first column will represent (j> in the way that 

- the domains of variables for 1 < j < c are Cj, 

following 2v columns together with the first column will represent satisfied literals 

- the domains of variables X/,i+i for 1 < i < v and 1 < j < c are {pi, 0} and 

- the domains of variables X,- j+(i+u) for 1 < i < v and 1 < j < c are {^Pi, 0}, 

following 2v columns will be a padding of 0-s 

- the domains of variables Xj i+ ( 1+2v ) for 1 < i < 2v and 1 < j < c are {0}, 
following v columns will represent unsatisfied literals 

- the domains of variables Xj t i + n + 4 V \ for 1 < i < v and 1 < j < c are {pi, ~^Pi} 
and the last 2v columns will be again a padding of 0-s. 

- the domains of variables X 3 i+ ( 1+5lJ ) for 1 < i < 2v and 1 < j < c are {0}. 

For example, the CNF from our running example would be reduced to the con- 
straint Spacing f (5, [16, 16, 16] , [88, 88, 88] , X) with S = {p, q, r, ~^p, ~^q, ->r} and 
X would contain 88 variables with domains ordered into the following table: 
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We can split the table into two parts. We will address the first 2v + 1 columns as 
a positive part and the 4u + 2nd to 5v + 1-st column as a negative part. (The rest is 
padding of 0-s.) In order for an assignment to X to be a support, the following must 
hold: 

- A value d G S can occur at most once per row because of the minimal distance 
condition imposed by the sequence A. While the number of rows is k, any value 
d E S has to occur in each row, so all values from S have to occur in each row 
exactly once. 

- Two complementary values d,d G S cannot occur in the same part of any row. For 
the negative part this is due to construction of the domains and if both occurred in 
the positive part, none of them would be able to occur in the negative part of the 
same row, so there would be a variable with empty domain, which is a contradiction. 

- If a value from S occurs in one part of some row, it will occur in the same part in 
each row. This is because of the following: 

• If a value d G S occurs in the negative part of a row number j, it cannot occur 
in the positive part of the following row j + 1, because of the minimal distance 
condition, so it will occur in the negative part of the row j + 1. 

• Suppose a value d G S occurs in the positive part of a row number j. Then 
the complementary value d has to occur in the negative part of the row j. Now 
if j ^ k and if the value d did not occur in the positive part of the following 
row j + 1, then it would have to occur in the negative part. Then, however, 
the complementary value d would not be able to occur in that negative part of 
the row j + 1, which is contradiction with the above. So d has to occur in the 
positive part of the row j + 1. 

• Each value from S has to occur in the first row. So if a value from S occurred 
in some row in a different part as in the first row, we would reach contradiction 
by applying the above. 

Consequently, if we have a support of the constraint, the assignment to the first 
2v+l variables (the positive part) represents a model of <p. The set of the values assigned 
to this variables (without the value 0) is clearly an interpretation, because it cannot 
contain complementary literals. Some value is selected for each variable representing 
clause (first column) and this value has to occur in the positive part of each row and its 
complement cannot occur in the positive part, hence the interpretation is also a model, 
because it contains at least one literal from each clause. 

We can obtain a support of the constraint from a model I of <j> by assigning a value 
that represents a literal that is satisfied by / to the variables in the first column. This 
is always possible, because / must satisfy at least one literal from each clause. The 
assignment to the variables in the positive part will be as follows: (1) The value is 
assigned to the variable with domain containing value already assigned to the first vari- 
able in the row. (2) The literals satisfied by I are assigned to the rest of the variables 
as values. (3) Where this is not possible, is assigned. This is always possible, because 
each literal that may occur in 4> is represented by some value from the domains of these 
variables and no two representations of two different literals occur in a domain of one 
variable. Further, we assign the variables in the negative part so that they contain rep- 
resentations of each literal that is falsified by /. Such an assignment is always possible 



because: (1) Each of these variables will be assigned, because / falsifies either pi or 
— for each 1 < i < v. (2) And there will be at most one possible value for each 
variable, because / is consistent. We assign to the rest of the variables (their domains 
are {0}). Now the assignment is a support of the constraint, because each value d£S 
occurs exactly once in each row which is c = k occurrences. Further, due to the fact 
that the representations of the satisfied literals occur always in the positive part and the 
representations of the falsified literals occur always in the negative part, there is always 
at least 5v places between two successive occurrences of each d 6 S. In details, the 
last possible occurrence of a representation of a satisfied literal in the jth row is on the 
position (j — l)(7v + l) + (2v + 1) and the next possible occurrence of a representation 
of a satisfied literal is at the beginning of the following row (J + l — 1) (7v + 1) + 1. The 
difference of these two positions is 5v + 1 > 5v. The last possible occurrence of a repre- 
sentation of a falsified literal in the jth row is on the position (j — l)(7v + 1) + (5v + 1) 
and the next possible occurrence of a representation of a falsified literal is at the posi- 
tion (j + l — l)(7v + 1) + (4u + 2) of the following row. The difference of these two 
positions is 6i? + 2 > 5v. □ 

Another intractable restriction of the constraint is when all values from S are forced 
to occur on the first p < n places, e.g. in the reduction from the last proof it was the 
first row. The following corollary summarizes all found intractable restrictions 

Corollary 1. Enforcing DC on the SPACING constraint is NP-hard even if any combi- 
nation of the following not containing (3) and (4) simultaneously holds: 

(1) All the values from S must occur in the first p < n places of the sequence. 

(2) A and B are constant sequences, i.e. a% = . . . = and b\ = . . . = b^. 

(3) There is no minimal distance condition {ai = I for 1 < i < k — 1). 

(4) There is no maximal distance condition (bi = nfor 1 < i < k — I). 

Proof. The proof of Theorem [3] holds for all combinations of the restrictions not in- 
cluding (3) and the proof of Theorem [2] holds for all combinations of the restrictions 
not including (4). □ 



5 Bounded Size of S 



We identified two useful restrictions of the SPACING constraint that allow polynomial 
time DC filtering algorithms. The first restriction bounds the size of S, \S\ = 0(1). It 
can be used to model education process where the number of learning units is naturally 
bounded. 

Theorem 4. Enforcing DC on the Spacing(5, A, B, X) constraint can be done in 
0(n^ +2 \S\) time. 

Proof. We can define an automaton for accepting sequences satisfying the SPACING 
constraint. The states of the automaton just need to keep count of the number of steps 
since the last occurrence of each value in S. There are 0(rJ' s l) possible states in this 
automaton, which is polynomial for \S\ = 0(1). □ 



6 The One Voice Global Spacing Constraint (SpacingI ) 



The second tractable restriction of the S PACING constraint ensures that all values from 
S occur in the first period of length p and they must repeat in the successive k—1 periods 
on the same places. In other words, the first period of length p is cycled k times. This 
restriction is useful in music composition problems [6|, where the composer wants to 
generate one voice consisting of a p beat long rhythmical pattern that is played k times. 
The pattern consists of |5| onsets (beginnings of notes) that must be played exactly k 
times in the whole voice. This can be encoded using a restriction of SPACING that is 
defined as follows: 

Definition 3. Let X = [X\, . . . , X n ] be a sequence of n variables and let S C D be 
a set of domain values. Let p and k be natural numbers such that p < n and pk < n. 
Then SPACINGI (S,p, k, X) holds iff SPACING^, A, B, X) with S C {Xl, . . . ,X p }, 
a, = hi = pfor 1 < i < k — 1 and \ {j \ Xj = d,l < j < n}\ = kfor all d G S holds. 

Theorem 5. For any constraint SpacingI (S, p, k, X), there is a bipartite graph G = 
(U, V, E) such that there is a support for the constraint iff there is a maximum matching 
in G. Enforcing DC on the constraint takes 0(p 2 k + p 2 5 ) time down a branch of the 
search tree. 

Proof. First, we observe that values D \ S are interchangeable as we do not distinguish 
between values d outside S,d ^ S. Therefore, we perform channeling of variables X to 
variables Y and map all values outside of S into a dummy value 0: Xj 6 S -H- Yi = X* 
and Xi $ S O Yi — for 1 < i < n, where ^ S is a fresh value. 

Second, we exploit the special structure of S PACING 1. Namely, variables in posi- 
tions i.p + i, . . . , (k — l)p + i, 1 < i < p, must take the same value. Hence, to check 
whether value d can be assigned to one of the variables Yi, Y p+ i, . . . , Y^-i) p +i, we 
need to check whether d € Dj=o ^(Xjp+i)- ^ e use a folding procedure to identify 
possible positions for each value. We fold the domains of Y into Pi = C\j=o ^C^j'p-h) 
for 1 < i < p. 

Finally, we need to match values S U {0} with the positions in one period. To avoid 
using generalized matching, we introduce p — \S\ copies of the dummy value, otherwise 
we would have to match with p — \S\ nodes. 

Next we describe construction of the graph G. The sets of nodes are U = S U 
{0j | 1 < j < p - \S\}, V = {1, . . . ,p}. The set of edges is E = {(d,i) | d € 
Pi n S, 1 < i < p} U {(0j,i) | 1 < j < p - \S\,0 e Pi, 1 < i < p}. Now we show 
that there exists a support for S PACING 1 iff there exists a maximum matching in G. 

Having a support for the constraint, we can obtain a subgraph of G with the same 
sets of nodes and the set of edges M that consists of two parts: (1) the edges between 
the places of the variables in the first period and the values from S that are assigned to 
them in the support M\ — {(d, i) \ d = Xi, d € S, 1 < i < p], (2) if we order the 
rest of the places in the first period (that contain values not in S) into a new sequence 
N = [i\Xi ^ S, 1 < i < p] (note that \N\ — p — \S\), the edges between these places 
and the dummy values with the same index M% = {(0j,Nj) | 1 < j < p — \S\}. 
So M — Mi U A'h. Now we have indeed a subgraph of G, because M is a subset 



of E, because, while we have a support, values from S have to repeat on the same 
places in each period, so Xi 6 Pi if Xi S S and € Pi if Xi £ S for 1 < i < p, thus 
Mi C {(d, i) I d € PiDS, 1 < i < p} and M 2 C {(Oj, i) | 1 < j < p- 6 P t , 1 < 
i < p}. M is also a matching in G, because exactly one value is assigned to each 
variable and each dummy value is connected to exactly one place. M is a maximum 
matching, because it contains all the nodes, because each value from S have to be 
assigned to some variable in the first period and the number of the dummy values is 
exactly the number of variables in the first period with values not in S assigned to them. 
Hence, we obtained a maximum matching in G from a support for the constraint. 

Now having a maximum matching M in G, we can obtain a support for the con- 
straint by assigning the values from S to the places in each period to which they are 
matched by M and any value not in S to the places in each period to which a dummy 
value is matched by M. This is a valid assignment, because M matches something with 
each place, because it is maximal. This assignment is always possible, because G con- 
nects each d 6 S only to places of variables with domains containing d in each period 
and G connects dummy values only to places of variables with domains containing a 
value not in S in each period. The assignment is indeed a support, because each value 
from S is matched with exactly one place of a period and repeats altogether k times on 
the same place in each period. 

Our DC propagator is based on DC propagator for AllDifferent by Regin[4|. 
First, we determine the set of edges that do not belong to any maximum matching the 
same way as propagator for AllDifferent enforces DC. This takes 0(p 25 ) down 
a branch of the search tree. If an edge (u, i) does not belong to any matching then the 
value u can be removed from domains of variables Yi , Y p +i , . . . , Ytk- i)p+i which takes 
O(k) time. There can be at most 0(p 2 ) removals down a branch, so overall complexity 
down a branch of the search tree is 0(p 2 k + p 2 ' 5 ). □ 

For example, consider the SPACING 1 constraint with D = {a,b, c,o},S — {a,b,c} 
representing the onsets, p = 5 length of the pattern and k = 3 number of repetitions on 
the sequence of 15 variables. The variable domains are as shown below: 
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During propagation, channeling will simply replaces o with 0. Then the folded domains 
will look like this: 
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And finally, the bipartite graph is given in Figure Q] There are two maximum matchings 

in this graph: {(c, 3), (a, 2), (b, 4), (0i, 1), (0 2 , 5)} and {(c, 3), (a, 2), (6, 4), (0 2 , 1), (0i, 5)}. 

This means that the only support is [o, a, c, b, o, o, a, c, b, o, o, a, c, b, o]. 




, . . . , Xis]) from the example 



7 The h- Voice Global Spacing Constraint (Spacing^) 

A composer would usually want to compose more voices playing at the same time with 
no overlapping onsets. The h- Voice Global SPACING Constraint, which is simply a 
conjunction of more SpacingI constraints on the same sequence of variables, can be 
used for this purpose. 

Definition 4 (Spacing^). Let X = [X\ , . . . , X n ] be a sequence of n variables and let 
S = [Si, ... , Sh] be a sequence ofpairwise disjoint sets of domain values (Ui=i &l — 
D and H Si 2 = for all 1 < l\ < l 2 < h). Let [pi, . . . ,Ph] be a sequence 
of natural numbers and let k be a natural number such that p\k < n for all 1 < 
I < h. Then SPACING^(S, [pi, . . . ,ph\ , k, X) holds iff a conjunction of constraints 
SpacingI (Si,pi,k, X) for all! <l <h holds. 

Theorem 6. Enforcing DC on the Global SPACING^ Constraint is NP-hard even for 
h = 2. 

Proof. We prove this by reduction of SAT to the problem of finding a support for 
SPAClNG/j. The main idea of the proof is similar to the other hardness proofs: The 
alternative choice of a literal satisfying a clause is modeled by choice of a value for a 
variable and mutual exclusion of complementary literals is enforced by properties of 
Spacing. For the Spacing/j constraint it is mutual exclusion of values from the same 
variable domain (assignment of one excludes the others). The system how choosing 
two complementary literals to be true leads to the mutual exclusion of values is quite 
complicated, because we have little freedom (just two voices). The idea is that the lit- 
eral chosen to satisfy a clause is copied to the part representing the model. This is done 
by the impossibility of having more than one occurrence of the same value in one pe- 
riod of a voice, so from domains of cardinality 2 containing this value the other one 
has to be chosen. These other values (that represent the model) belong, however, to the 
other voice, which has period of different length, so repetitions of these values will be 
aligned with different places of the other periods of the first voice. On the one hand, this 
is used for copying the clause satisfiers to the same model representation, on the other 
hand, it is used to align values for complementary literals with each other so that their 
assignment mutually exclude each other. 

Now we describe the reduction in detail. Please, recall that if L is a set of literals, 
then L' = {V \ I e L} and V = {l l \ I G L}. Let 4> be an arbitrary CNF with v 
propositional variables and c clauses. There is a model of <fi iff there is a support for the 
constraint SPACING/^ [Si, S 2 ] , [pi,P2] , k, X) where 



- Si = Ht^y u \j. =1 (ut{<f>y) 

- S 2 = Ut((p) 

- pi = c + 6cv 

- P2 = c + 6cv + 2v 

- k = c 

- X is a sequence of (c + 6cv + 2v)k variables with the domains as described below. 

We will describe the domains voice by voice, so ultimately the domains are minimal 
sets satisfying the following conditions. The period of the first voice is p\ = c + Qcv, let 
us split the sequence into rows of this length and order them into a table (one row is one 
period of the first voice). Let Xj i denote X(j_ 1 * )pi+i . Then the first c columns represent 
<f> in the following fashion: The cells on the main diagonal represent the clauses and the 
other cells are filled with representation of all the literals, 

- the domains of variables Xj t for 1 < j,i < c contain {p 1 \ p e Cj} if i = j, 
otherwise lit((j)) 1 . 

The following Acv columns contain representations of all literals for each clause two 
times, 

- the domains of variables X 1 , , w , . for 1 < j, i < c, < y < 1 and 
1 < x < v contain p l x if y = and -^p l x if y = 1, 

- the domains of variables X 1 o {c+2cv)+{i _ 1)2v+vy+x for 1 < j, i < c, < y < 1 and 
1 < x < v contain p x if y = and -ip l x if y = 1. 

The last 2cv columns contain representations of all literals in the usual order in their 
first 2v columns, in their last 2v in the order where positive and negative literals are 
swapped, and in the rest of their columns, 

- the domains of variables X 1 , , , % . for 1 < j < c and 1 < x < v contain p' 

],(c+4cv)+x — j — — — fx' 

- the domains of variables X 1 < ■, . . for 1 < j < c and 1 < x < v contain 

- the domains of variables Xj ^ c+icv - )+2v+x for 1 < j < c and 1 < x < 2cv — Av 
contain 0, 

- the domains of variables Xj {c+Acv)+{2cv _ 2v)+x for 1 < j < c and 1 < x < v 
contain ~^p' x , 

- the domains of variables Xj ( c+icv)+{2cv _ v)+x for 1 < j < c and 1 < x < v 
contain p' x . 

(This is not well defined for c = 1, but cj) is trivially satisfiable in this case.) The 
arrangement of the values of the second voice is much more simple. The period of 
the second voice is pi = c + Qcv + 2v, so let X"j i denote X(j_i} P2+i . Then places 
from c + 1 to c + 2v and from c + Acv + 1 to c + Acv + 2v of each period contain 
representations of all literals in the usual order, 

- the domains of variables X? i c \ +x for 1 < j < c and 1 < x < v contain p x , 

- the domains of variables X"* , c+v s +x for 1 < j < c and 1 < x < v contain ^p x , 



- the domains of variables X? ^ c+icv ^ +x for 1 < j < c and 1 < x < v contain p x , 

- the domains of variables Xj ^ c+icv+v ^ +x for 1 < j < c and 1 < x < v contain 

-'Px- 

Finally, the domains of variables that are not on the first c places of any period of the 
first voice and do not contain a value from S% contain 0: 

- the domains of variables Xi for i ^ {(j — l)pi + x | 1 < j, x < c} U {(j — l)p2 + 
c+x I 1 < j < c, l<x< 2v}{J{(j-l)p 2 + c+4cu+x 1 1 < j < c,l < x < 2v} 
contain 0. 

The constraint for the CNF from our running example would be imposed on a se- 
quence of variables with domains displayed in the TableQ] The last 2vk domains of {0} 
are not displayed in the table. 

Let us organize the sequence X into a table with c + 6cv columns again (using Xj i 
notation) and ignore the last 2cv variables with domains {0}. Each row belongs to one 
clause. Also there is one set of literal representing values for each clause (lit(4>y for 
Cj). Clauses form domains on the main diagonal using their sets of values. The rest 
of the domains in the first c columns are constructed to enable repetitions of values 
selected for the clause variables. The next 2cv columns are called positive part, because 
the values from S2 it contains will represent a model of <fi. Due to the difference between 
periods, in each row j these values from S2 are aligned with the values of the clause of 
the row (values from lit((f>y). Thanks to this, when a value <P is assigned to the variable 
of clause Cj, d? cannot occur in the rest of the row, thus also not in the positive part, so 
the appropriate value d has to be selected in this part. This is why, having a solution of 
the constraint, at least one literal from each clause will be assigned as a value from S2 
to some variable in the positive part. 

The last 2cv columns are consistency part, because they ensure the consistency of 
the model represented by a solution. When a value d G S2 is assigned to a variable in 
the positive part, it has to be repeated in the positive part in each row. Also it cannot 
be repeated in the rest of any row. Hence a corresponding primed value d' G Si must 
be assigned to one of the first 2v places of the consistency part of the first row (and 
also each other row), thus d 1 cannot repeat in the last 2v places of the consistency 
part in any row, mainly not in the last one. Please note that the positive and negative 
primed values are switched in the last 2v places, so that, in the last row, the negative 
primed values are aligned with the positive values from S2 and the positive ones are 
aligned with the negative ones. This whole construction of the domains causes that if 
two complementary literal values d, d G S2 are assigned to variables in the positive 
part, they cannot be assigned to the first 2v variables in the first row of the consistency 
part, so both d',d' G Si have to be assigned in these first 2v variables and cannot 
be assigned to the last 2v variables in the last row of the consistency part. However, 
the same assignment of values from S2 must be repeated in each row of this part, so 
two variables in the last 2v places of the last row would be left with empty domains. 
Analogous reasoning rules out the case when none of d, d G S2 is assigned in the 
positive part. Also it is easy to see that when exactly one of d, d G S2 is assigned 
in the positive part, no contradiction is reached. This shows that, having a solution of 
the constraint, the set of literals assigned as a values from S2 in the positive part is 



Table 1. Example for reduction from SAT to SPACING,,. 
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consistent. So, together with the result above, having a solution of the constraint, the set 
of literals assigned as a values from S2 in the positive part is a model of <j>. 

We can obtain a solution of the constraint from a model of 4> in the following way: 
We assign the literals from the model as a values of the second voice to the variables 
in the positive part. Then we choose a value that represents some literal satisfied by the 
model for the clause variables. And assign the rest of the variables so that the constraint 
holds. It is always possible to do this. It is always possible to assign the literals from 
the model to the positive part, because there is one variable for each literal in each row 
of the part and these variables repeat with the period of the second voice (where the 
values are from). It is always possible to complete the assignment to the positive part, 
because each value has an alternative in each domain and each value can be repeated 
with it's period. As long as the model is consistent, no contradiction can be reached in 
the consistency part because of the following: For any pair of complementary literals 
d,d € S2, one of them is assigned in the positive part and the other one is not. The 
selection of the values from S2 in the consistency part is reversed with comparison to 
the positive part, so the mutual exclusion still holds. The selection of the primed values 
in the first 2v columns of the consistency part will represent the model again and the 
selection of the primed values in the last 2v columns will be reversed again. Due to the 
fact that the positive values are swapped for the negative ones in the last 2v columns, 
assignment of the primed values complements the assignment of values from S2 in the 
first as well in the last 2v columns and no domains are emptied. It is always possible 
to select some value for the clause variables because of the following: In the positive 
part of each row j, the selection of the values indexed with j will be complementary to 
the selection of the values from the model. This leaves the possibility for the indexed 
values representing literals satisfied by the model to be selected elsewhere in the row, 
for instance in the clause variable. Each clause variable always contains at lease one 
value representing literal satisfied by the model. Assignment to the clause variables can 
be repeated in each row and the primed values that could not be selected in the positive 
part nor in the clause values can still be assigned to some of the variables in the columns 
between c + 2cv + 1 and c + Acv called negative part. □ 

8 Incomplete filtering of Spacing^ 

Since already a conjunction of two S PACING 1 constraints is NP-hard, we decided to de- 
vise an incomplete additional rule that facilitates propagation between two SPAClNGl 
constraints on the same sequence. If periods of these two constraints are of different 
length, assigning a value from the first one to one variable forbids assigning any value 
from the second one to multiple variables. For example, take SPAClNGl ({a, b} , 5, 4, X) 
and SPAClNGl ({c, d} , 7, 3, X) on the same sequence X of 21 variables. We start with 
full domains for all variables. After assign a to the first variable and filtering the do- 
mains are: 
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If we assign b to Xt, the assignment must be repeated every 5 variables. These repeti- 
tions occur on different places of the second period, so sufficient number of repetitions 
of c and d will not be possible. Subsequent filtering would remove circled values 
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and render the second constraint unsatisfiable. The same course of reasoning holds for 
assigning b to X 2 and X^. Thus only possibility is to assign b to X3, which leaves only 
two places free in the period of the second S PACING 1 , hence two symmetrical solutions. 

Formally, we first need to count a number of places that may contain a value from 
Si for each SpacingI constraint I. We denote it by U[ and define as 



ui = \{i\SinP}?Q,l<i<Pi}\ 

where P\ are the folded domains from the proof of Theorem |5]f or the constraint I. Now 
we need to count a number of places that are not blocked for values from Si 2 , but would 
become blocked after assigning a value from Si t to a variable Xi in the first period of 
l\ (i.e. 1 < i < pi x ). This count is denoted by 6/' 2 and defined as 

\{x I x = ((i+jph -l)mod piJ + 1,0 < j < k h ,i+jp h < k h pi 2 ,Si 2 nD(X x ) ^ 0}| 

for each pair of different voices li, I2 and 1 < i < pi ± . 

Theorem 7. Leth = SpacingI (Si ± , pi ± , ki t , X) and h = SpacingI (Si 2 ,pi 2 , h 2 ,X) 
be two constraints defined on the same sequence X with Si t d Si 2 = that are DC. 
Let ui 2 and 2 be defined as above. Ifb^' 2 > ui 2 — \Si 2 \ then there is no support for 
l\ A I2 with any value from Si t assigned to Xi within 1 < i < pi x . 

Proof. Suppose the premises of the theorem hold. The places in the first period of I2 
that may hold a value from Si 2 (together with places that already hold it) are called/ree. 
The rest of the places in the first period of I2 are called blocked. The number of the free 
places is ui 2 . Suppose that we assigned a value d\ € S^ to Xi. Then d\ has to repeat 
on the same place in each period of l\, that is places i+jp^ for each < j < ki 1 . Now 
no value from Si 2 can be put on those of these places that are constrained by I2, i.e. 
i + JPh ^ h 2 pi 2 for < j < ki x . So no value from Si 2 can repeat on whatever places 
in the period pi 2 these places are aligned with. Arbitrary index y into the sequence X 
is ((y — 1) mod pi 2 ) + 1-st place in a period of I2, so no value from Si 2 can be assigned 
to any variable with index x = ((i + jpi 1 — 1) mod pi 2 ) + 1 where i + jpi x < ki 2 pi 2 
for < j < k h . While l 2 is DC and 1 < x < p h , S h n D{X X ) = S h n P l x 2 . 
This restricts the places we are counting in b^ 1 ' 12 to those that were free, so fo' 1 ' 2 is the 
number of places that are blocked exclusively by assigning di to Xj. Finally, suppose 
fr' 1 ' 2 > ui 2 — \Si 2 \ holds. This is equivalent to ui 2 — < \Si 2 \ which means that 
the number of free places without the places blocked solely by d\ on the place i (the 
places left free after the assignment) is less than the number of values from Si 2 we need 
to assign to one period of l 2 . The constraint l 2 is obviously unsatisfiable in this case, so 
there is no support for l\ A I2 with any value d £ Si x assigned to Xi. □ 



9 Application 



One of the applications of CSP in music composition is the problem of Asynchronous 
rhythms described by Truchet H5I61 . Having h voices and a time horizon H, the goal is 
to construct one rhythmical pattern for each voice. The patterns are played repeatedly 
until the time horizon is reached. The pattern of voice I is of length pi and consists of 
mi onsets that have to be placed so that no two onsets are played at the same time. We 
consider also extension of this problem, where composer may want to impose additional 
constraints, such as forbidding or enforcing particular onsets on particular places, or 
restricting the density of the onsets e.g. at most one onset on two successive places. 

Truchet [5 6| formalized this problem in terms of variables Vi t i that denote time 
when an onset i is played in the pattern of voice I (so D(Vi t i) = {1, . . . ,pi}) and the 
following constraints: AllDifferent({V/ i1; . . . , Vi >mi }) for each voice I, and Vi 1 . ll + 
jiPh 7^ ^2,22 + j2Pi 2 f° r eacn P au " of different voices l\, h, each onset i±, 12 in each 
voice respectively and each j\, 32 s.t. Vi 1 .i 1 + j%pi < H and Vi 2 .i 2 + j-2.P1 < H. This, 
however, cannot be encoded as a CSP model, because if H mod pi ^ for some voice I 
then the range of ji in the later constraint depends on Vi.i for each onset i. We fix this by 
minor modification, where the pattern of each voice I is repeated exactly ki times and 
the indexes j\, j% in the last constraint are quantified by < j\ < ki x and < j'2 < h 2 ■ 
We will call this model OM. 

We encoded the problem into three new models. The first model (SM) uses one 
SpacingI (Si, pi, ki, X) for each voice I on the same sequence of variables X. The 
sets of values Si represent the onsets for each voice I, so they must be pairwise disjoint. 
The length of the pattern of voice / is pi and the number of its repetitions is fcj. Each 
variable Xj represents the onset played in the beat j, so D(Xj ) = Uj=i &t U {^} where 

Theorem 8. Enforcing DC on SM is strictly stronger than enforcing DC on OM. 

Proof. First we show that if every constraint in SM is DC, then also every constraint 
in OM is DC. 

Suppose every constraint in SM is DC. 

The first question is what is the correspondence between the two models in this case, 
i.e. if SM is DC, with what domains should we check OM for DC. The correspondence 
should be semantical. That is, for every placing of every onset whether it is possible to 
place other onsets so that each constraint of a model is satisfied. While all the S PACING 1 
constraints are DC, the domains restricted to their values are the same in each of their 
period. So the correspondence between the models can be described only on the first 
periods. It is d G D(Xi) where d G Si and 1 < i < pi iff i G D(Vi t d). 

If every constraint in SM is DC, then also every AllDifferent in OM is DC, 
because the bipartite graph in the propagator of S PACING 1 of voice / is a supergraph 
of AllDifferent for voice I in OM. So while the SpacingI is DC, all edges in 
its graph belong to some maximum matching. If we remove each node, we obtain a 
graph of the corresponding AllDifferent. Each of the edges in the new graph still 
belong to some maximum matching in it, because removing nodes preserves maximum 
matchings, because while each edge is in some maximum matching and each node is 



in at most one edge of a matching, each maximum matching loses exactly one edge per 
removed node. 

If every constraint in SM is DC, then also every difference constraint in OM is DC. 
The semantics of a difference constraint V/ x i j 1 + 3\Pi x ^ Vi 2 ,i 2 + ]2Pi 2 is that the onset 
ii in the j\ + 1-st period of the voice l\ cannot be played at the same time as the onset 12 
in the 32 + 1-st period of the voice h. If a difference constraint was not DC, this would 
mean that for some placing of the onset i\ it is not possible to place the onset 12 so that 
they would be played on different times in the j\ + 1-st and j 2 + 1-st periods of their 
voices respectively. This is possible only if the placing of i 2 is fixed (D(Vi 2t i 2 ) = {x}). 
This, however, means in the SM that the value of this onset has only one place where it 
can occur in one period of l 2 (12 £ D(X y ) iff y = x for 1 < y < pi 2 ), so the propagator 
of S PACING 1 for this voice would instantiate it and hence remove other values from the 
domain of the variable of the place (D{X X ) = {12})- This means for SPAClNGl of h 
instantiating the variable of the place to (D(Y^ ) = {0}), so assuming SM is DC we 
have contradiction with possibility of placing i\ so that the difference constraint cannot 
be satisfied (x Z^V^jJ because i\ g D(Y^) = {0}), so the difference constraint 
must be DC. 

To show strictness, consider instance of Asynchronous rhythms with two voices, first 
having m\ = p\ = k\ = 2 and the second one m 2 = 1, P2 = 3, k 2 = 2. Additionally, 
the very first place has to contain an onset of the first voice. Domains of variables of 
OM for this instance are D(V ha ) = D(V hb ) = {1,2} and D(V 2 , C ) = {2,3} and 
the constraints are AllDifferent({Vi, q , Vi^}) and V M ^ V 2 , c , Vi ti + 2 ^ V 2 , c , 
V\,i 7^ V2.C + 3, V\.i + 2 7^ V2.C + 3 for i G {a, b}. All the constraints of this model are 
DC. However, domains of variables of SM are D(X\) = {a, b} and D(X2) = . . . = 
D(X 6 ) = {a, b, c, 0} where Si = {a, 6} and S 2 = {c}. The propagation of SPAClNGl 
for the first voice will remove all values not in Si from D(X2) up to D(X4). This leaves 
the S PACING 1 for the second voice unsatisfiable. So SM determines that the instance 
is unsatisfiable without search while OM is not able to do so. □ 

Drawback of SM is that, due to interchangeability of onsets in one voice, it gener- 
ates a lot of symmetrical solutions. This observation led to the second model (SB). 
It is the same as the first one, except it uses a version of SPAClNGl that does not 
distinguish between different values in one voice, which is a piece of information a 
composer does not need when constructing rhythmical patterns. This new constraint, 
denoted SPAClNGss(d, m,p, k, X), can be defined as SPAClNGl (S,p, k, X) where S 
is a multiset of m values d. In other words, SPAClNGss((i, m,p, k, X) is satisfied iff 
there is exactly m occurrences of d in the first p places of X and this pattern is re- 
peated k times. Filtering algorithm of SPACING5B takes 0(n) time down a branch of 
a search tree as we can avoid finding the maximum matching step. After folding of the 
domains Pi = Hj=o D(Xj p+ i) for 1 < i < p, the filtering algorithm simply counts 
u = \{i I d e P.^ 1 < i < p}\ and v — \{i \ Pi = {d} ,1 < i < p}\. The algorithm 
fails iff u < m or v > m, if u = m it instantiates all with d £ D(X{) to d and 
if v — m it removes d from all domains that D{Xi) ^ {d}. Of course, the algorithm 
reflects the folded domains Pi back to the domains D(Xi) by removing d if d ^ Pi or 
instantiating to d if Pi = {d} and repeats all the instantiations and all the removals on 
the appropriate places in each period. 



Theorem 9. Enforcing DC on SPAClNGgs can be done in 0(n) time down a branch 
of the search tree. 

Proof. In order to show that the filtering algorithm described above really establishes 
DC, we need to show that it fails iff the constraint is unsatisfiable and if it does not fail 
the constraint is DC. 

Thanks to the folding of the domains, reflecting the folded domains back and repe- 
tition of instantiations and removals in each period, for the domains on the same places 
of each period ({D(Xj p+i ) < j < k} for each i) it holds that one contain d or 
is {d} iff all the others contain d or are {d} respectively. This allows us to restrict out 
reasoning only to one period. 

First, we show that the algorithm fails iff the constraint is unsatisfiable. After what 
is shown in the previous paragraph the only way how to falsify the constraint is to put 
either too many or too little values d to one period. Obviously, u < m means that there 
is not enough variables in one period with d in their domains and v > m means that 
d is assigned to too many variables in one period. So u < m or v > m holds iff the 
constraint is unsatisfiable, so the algorithm fails iff the constraints is unsatisfiable. 

Second, we show that if the algorithm removes a value from the domain of a vari- 
able, there is no solution of the constraint with this value assigned to this variable. 
Suppose the algorithm removes d from D(Xi). Except the cases considered in the first 
paragraph, the algorithm does so only when v = m and i is not one of the places 
counted in v. So if there was a solution with X, = d, the number of occurrences of d in 
one period would be m + 1, which is a contradiction. Suppose the algorithm removes 
d! 7^ d from D{X{). Except the cases considered in the first paragraph, the algorithm 
does so only when u = m and i is one of the places counted in u. So if there was a 
solution with Xi = d', the number of occurrences of d in one period would be m — 1, 
which is a contradiction. 

Finally, we show that if the constraint is satisfiable and there is no solution of the 
constraint with a value assigned to a variable, the algorithm removes the value from the 
domain of the variable. Suppose the constraint is satisfiable and there is no solution of 
the constraint with a value d' assigned to a variable JQ. Except the cases considered in 
the first paragraph, the constraint may have no solution only in two cases: The value d 
is assigned to too many or too little variables in one period. The constraint is satisfiable 
without the assignment Xi = d'. So the first case is possible only if there are exactly m 
variables in one period with d assigned to them and the assignment Xi = d! increases 
this number. This means that Xi is not instantiated to d yet, d' = d and v = m holds, 
so the algorithm will remove d from the domains of all the variables that are not instan- 
tiated to d yet, hence also from D(Xi). The second case is possible only if there are 
exactly m variables in one period with d in their domains and the assignment Xi = dl 
decreases this number. This means that Xi contains d in its domain, d' ^ d and u = m 
holds, so the algorithm will assign d to all the variables that contain d in their domains, 
hence remove d 1 from D(Xi). 

The algorithm runs in 0(n) time down a branch of the search tree, because the 
folded domains Pi are represented implicitly. At the beginning of the search, folding 
and reflecting back to the domains takes 0(pk), because it is done simply by, for every 
1 < i < p, removing d from each D(Xj p+i ), < j < k if it is missing in some and 



instantiating each Xj p+ i, < j < k to d if some already is instantiated. During the 
search it is sufficient just to update u and v upon removal of d or instantiation to d. On 
each update, the same action (removal or instantiation) has to be repeated in each of k 
period. Down a branch there will be at most p such updates, because after p updates the 
pattern of d-s in a period is fully determined. That makes 0(pk) steps down a branch. 
While pk < n, the filtering runs in 0(n) time down a branch of the search tree. □ 

The last model (SR) is the same as SM, but it is additionally making use of the 
incomplete filtering rule for S PACING/j (described in the previous section) between each 
pair of constraints. 



10 Experimental results 



To compare performance of the different models, we carried out a series of experiments 
on random instances of the Asynchronous rhythms problem. The instances were gen- 
erated for a fixed number of voices h, a mean length of the pattern of the first voice 
pi and a fixed number of repetitions of the last voice kh- The models were tested on 
ten instances for each tuple of (h,pi,kh). The generation of the instances followed the 
philosophy that the first voice should be the base voice with short pattern and small 
number of onsets and the other voices should have richer patterns. So the pattern of the 
second voice was 4± 1 beats longer that the pattern of the first voice and each following 
voice had pattern in average two times longer ±3 beats than two voices before. Num- 
bers of repetitions were set so that the voices overlap as much as possible. And the total 
number of the onsets was approximately 75% of the length of the sequence uniformly 
distributed between the voices. The experiments were run with 5 minute timeout and 
with a heuristic under which all models were performing better. The left side of Table|2] 
shows performance of the models on the basic Asynchronous rhythms problem. The 
right side of Table [2] shows performance of the models on the extended Asynchronous 
Rhythms problem, where the composer applies additional constraints that some onsets 
are forbidden in certain positions. We randomly removed 10% of values from the do- 
mains to model this restriction. SB was not tested on the extended problem, because in 
this case onsets are not interchangeable due to additional constraints on onsets. Experi- 
ments were run with CHOCO Solver 2. 1 .5 on Intel Xeon 3 CPU 2.0Ghz, 4GB RAM. 

As we can see in the results, SB solved almost all instances and, where comparable, 
it was the fastest and needed the least backtracks in solving basic model. SB is so suc- 
cessful because it removes symmetries and the filtering algorithm of SPAClNGgs runs 
in 0(n) down a branch. On the basic problem, SM is not obviously better than OM, 
however SM performs much better on the extended problem and it needs significantly 
less backtracks than OM. That supports the theory that SM achieves more propagation 
than OM. Finally, the additional rule significantly improves performance of SR against 
SM, especially the number of backtracks is lower by order of magnitude, which shows 
that the rule really facilitates propagation between the SPACING constraints. 



Table 2. Number of solved instances / average time to solve in sec / average number of backtracks 
in thousands. 
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11 Conclusions 



The global SPACING constraint is useful in modeling events that are distributed over 
time, like learning units scheduled over a study program or repeated patterns in music 
compositions. We have investigated theoretical properties of the constraint and shown 
that enforcing domain consistency (DC) is intractable even in very restricted cases. On 
the other hand we have identified two tractable restrictions and implemented efficient 
DC filtering algorithms for one of them. The algorithm takes 0(p 2 k + p 2 - 5 ) time down 
a branch of the search tree. We have also proposed an incomplete filtering algorithm 
for one of the intractable cases. We have experimentally evaluated performance of the 
algorithms on a music composition problem and demonstrated that our filtering algo- 
rithms outperform the state-of-the-art approach for solving this problem in both, speed 
and number of backtracks. 
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